Introduction

The theory that human capital in the form of education is a significant factor for economic growth is one that economists have been continuously investigating (Barro and Lee 1993). Barro and Lee state that female educational attainment has a positive, significant relationship with GDP per capita. Given that there are more males in the workforce than females, gender ratio may be an interesting predictor to explore regarding educational attainment and GDP per capita. Another interesting observation made by researchers is that children of higher cultural and social class families are more likely to gain educational credentials (Sullivan 2001). Thus, various aspects of inequality and families’ ability to afford schooling cost may be a subject to investigate regarding their effects on educational attainment. (Byun, Meece, and Irvin 2012) further discovered that rural students had more advantages in achieving a bachelor’s degree education compared to non-rural students in terms of having more social resources, even after controlling for individual family background, demographic background, and academic preparation. This shines some light on dissolving, to an extent, the complexity of the ongoing rural-nonrural disparities. Following Brun. S, et all (2012)’s observation, we thought that rural population and urban poverty would be important variables to include in our research.

The goal of our study is to contribute to the researchers’ work above and further explore the effects of the proven variables in the global level. We first discuss our method of collecting necessary variables and how we compiled our final dataset. Then, we present multiple plots (timeline plots, scatterplots, map of the world, etc) that showed any kind of noteworthy relationship with educational attainment. We also generate a shiny app where one can select a year, an explanatory variable, and view the level of primary educational attainment across the world. Lastly, we discuss some challenges that we overcame, limitations, and future directions of research.

Research Question

What are the explanatory variables that best explain cross-country level of educational attainment?

Our Data

The fundamental part of our data is from gapminder.com (“Data,” n.d.), a website with comprehensive cross-country data from various trustworthy sources, such as the World Bank (Dang et al. 2019). In order to measure the level of educational attainment, we select the primary school completion data as our response variable. A full definition and unit of each variable is summarized in the table below. We collected data for all variables and transformed each of them into a longer form with pivoting. Then, we inner joined them into the primary school variable dataset. The continent variable was created via R package and added to the primary school dataset. The decade and high GDP variables were created via mutating based on their definitions in the variable descriptions table.

Variable Description

Variable Description

World Map of All Countries in our Dataset

We have a total of 71 countries in our dataset, and the distribution of the countries are shown in the world map.

World Map from Our World in Data

World Map from Our World in Data

Our World in Data (Roser and Ortiz-Ospina 2016) actually made the same world map of their 2015 global data of primary school education attainment graph. One of our challenges was having such a small dataset, but since our dataset looks similar to that of a reliable source, we decided to keep using the primary school dataset.

Visual Analysis

Timeline Graph of Educational Attainment over Time by Continent

The plot above shows a timeline of educational attainment by continent from 1995 to 2015. Looking at the data, we see that Oceania and Europe had relatively high educational attainment levels over the two decades. The Americas increased at a steady rate, while Asia saw a drastic increase between 1999 and 2002. Educational attainment in Africa lagged other continents by a large margin in 1995. While the continent has steadily improved its educational attainment levels over the two decades, it still lagged other continents in 2015.

Shinyapp

Link to Shinyapp

We created a shiny app where someone can select a year between 1995 and 2015, which encompasses our data, and an independent variable such as GDP per capita that can help explain primary educational attainment.

Relationship between Educational Attainment and GDP per Capita

From the plot above, we can see that the relationship between educational attainment and GDP initially increases exponentially. However, after a certain point, educational attainment levels out compared with GDP per capita. This implies that at low GDP per capita levels, a marginal increase in GDP per capita leads to an increase in educational outcomes. On the other hand, at higher GDP per capita levels, an increase in GDP per capita has almost no impact on educational attainment. It is fair to assume that countries with high GDP per capita have greater disposable income to spend on education. However, after a certain point, spending more on education does not necessarily imply better outcomes, especially when educational attainment in countries with high GDP per capita levels is already pretty high.

Relationship between Educational Attainment and GDP per Capita by Continent

When we expand the graph above that shows the relationship between Educational Attainment and GDP per Capita to the continent level, we observe similar results. There is a general trend of increasing level of educational attainment as GDP per Capita increases. For low GDP per Capita countries, there is a steeper increase in educational attainment per one unit of increase in GDP per Capita compared to high GDP per Capita countries. We observe some outliers for countries in Asia with unusually high educational attainment above all the continent mean linear estimates of educational attainment, and we observe the opposite for the outlier countries in Africa.

Comparison of Educational Attainment between countries with high and low GDP per capita

The faceted plots above show the relationship between Educational attainment levels and GDP per capita by their classification (high GDP/low GDP). On the low GDP plot, we see that the average educational attainment in 1995 was lower than 50%, and by 2015 that number had risen to almost 75%. In contrast, in 1995, high GDP countries had educational attainment levels of roughly 80%, and this number increased slightly in the coming two decades. This plot confirms the findings of the Educational Attainment vs. GDP per Capita plot we saw earlier. GDP per is significant in predicting educational attainment at lower levels, but its influence as a predictor decreases as its value increases.

Comparison of Educational Attainment between countries with high and low GDP per capita by Continent

You can see that for some continents there is a distinct difference in educational attainment between high and low gdp countries, like Africa and Asia. In Europe and the Americas there is not much of a difference between the two groups.

World Map of High vs Low GDP Countries

The map above shows the classification of countries in our dataset by high GDP and low GDP. Countries with a GDP per capita over 3,995 were classified as high GDP countries and those with GDP per capitas below the 3,995 threshold were classified as low GDP countries. Based on World Bank income classifications, we chose this threshold (“New Country Classifications by Income Level: 2019-2020,” n.d.).

Relationship between Educational Attainment and Schooling Cost

The plot above shows the relationship between educational attainment and schooling cost. From the plot, we can see that the relationship between the two variables is fairly linear. This suggests that schooling cost has little impact on educational attainment. While this result may seem counterintuitive at first, on further analysis it begins to make more sense. The second plot shows us the relationship between GDP per capita and schooling cost by continent. Ignoring Europe and Oceania (since they have very few observations), we see that as GDP per capita increases, so does schooling cost. Countries with high GDP per capitas have higher schooling costs because they can afford to spend more on education. However, does not necessarily translate to better educational outcomes.

Relationship between Educational Attainment and Child Mortality Rate by Continent

We used data on child mortality as a proxy for children’s health outcomes. From the plot above, we can see that child mortality significantly affects educational attainment levels across all continents in our dataset. This indicates that children’s health outcomes are a significant predictor of educational outcomes

Relationship between Educational Attainment and Number of Workers in Family

Here we observe that most countries with a high GDP per Capita are clustered near the bottom left, meaning they have a low number of workers in the family. An interesting observation is some countries with low educational attainment has over 40 workers in a family.

Table of GDP vs. Educational Attainment by Continent

Here is a table that shows the average gdp per capita for the different continents, with no surprise that that the americas have the highest average gdp per capita while Africa has the lowest. Note that many of the high gdp countries of Europe aren’t included in this dataset, so this would drop their average gdp per capita.

Conclusion

There were many challenges we faced during the project. The first big one was scheduling, given that our group was split between two sections, and we all had very different and full schedules. However, we all made our best efforts to make time to produce a meaningful research project. Another problem we ran into was the data. Mainly all the missing values we had was because different countries reported different values for variables in different years. This is discussed above in the data section.

We created a shiny app where one can select a year between 1995 and 2015, which encompasses our data, and an independent variable that can help explain primary educational attainment. One limitation is that our shiny app is incompatible with our gender variable, and we did not have enough time to investigate the reason, so we removed the gender variable from the list of possible variables one could select.

In the future, we would like to expand the shiny app to incorporate different models and different methods to explore the different variables. We would also like to find a more complete dataset or use a proxy variable that can be an indicator for educational attainment other than primary school attainment variable so that we would be able to achieve more accurate predictors when we run regression analysis. For this project, we did not perform a regression analysis due to the nature of the class and since we wanted to utilize the data wrangling skills we learned in class. However, performing hypothesis testing to determine which variables would be the best predictors of educational attainment would be an interesting future project.

References

Barro, Robert J, and Jong-Wha Lee. 1993. “International Comparisons of Educational Attainment.” NBER. https://www.nber.org/papers/w4349.pdf.

Byun, Soo-yong, Judith L Meece, and Matthew J Irvin. 2012. “Rural-Nonrural Disparities in Postsecondary Educational Attainment Revisited.” American Educational Research Journal 49 (3). SAGE Publications Sage CA: Los Angeles, CA: 412–37.

Dang, Hai-Anh H., D. Barne, Omar Hadi, and John Baffes. 2019. “World Bank Open Data.” Data. https://data.worldbank.org/.

“Data.” n.d. Gapminder. https://www.gapminder.org/data/.

“New Country Classifications by Income Level: 2019-2020.” n.d. World Bank Blogs. http://blogs.worldbank.org/opendata/new-country-classifications-income-level-2019-2020.

Roser, Max, and Esteban Ortiz-Ospina. 2016. “Global Rise of Education.” Our World in Data. https://ourworldindata.org/global-rise-of-education.

Sullivan, Alice. 2001. “Cultural Capital and Educational Attainment.” Sociology 35 (4). Cambridge University Press: 893–912.